Confidence intervals for the population mean tailored to small sample sizes, with applications to survey sampling.

نویسندگان

  • Michael A Rosenblum
  • Mark J van der Laan
چکیده

The validity of standard confidence intervals constructed in survey sampling is based on the central limit theorem. For small sample sizes, the central limit theorem may give a poor approximation, resulting in confidence intervals that are misleading. We discuss this issue and propose methods for constructing confidence intervals for the population mean tailored to small sample sizes. We present a simple approach for constructing confidence intervals for the population mean based on tail bounds for the sample mean that are correct for all sample sizes. Bernstein's inequality provides one such tail bound. The resulting confidence intervals have guaranteed coverage probability under much weaker assumptions than are required for standard methods. A drawback of this approach, as we show, is that these confidence intervals are often quite wide. In response to this, we present a method for constructing much narrower confidence intervals, which are better suited for practical applications, and that are still more robust than confidence intervals based on standard methods, when dealing with small sample sizes. We show how to extend our approaches to much more general estimation problems than estimating the sample mean. We describe how these methods can be used to obtain more reliable confidence intervals in survey sampling. As a concrete example, we construct confidence intervals using our methods for the number of violent deaths between March 2003 and July 2006 in Iraq, based on data from the study "Mortality after the 2003 invasion of Iraq: A cross sectional cluster sample survey," by Burnham et al. (2006).

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Area specific confidence intervals for a small area mean under the Fay-Herriot model

‎Small area estimates have received much attention from both private and public sectors due to the growing demand for effective planning of health services‎, ‎apportioning of government funds and policy and decision making‎. ‎Surveys are generally designed to give representative estimates at national or district level‎, ‎but estimates of variables of interest are oft...

متن کامل

Precision of systematic and random sampling in clustered populations: habitat patches and aggregating organisms.

Natural populations of plants and animals spatially cluster because (1) suitable habitat is patchy, and (2) within suitable habitat, individuals aggregate further into clusters of higher density. We compare the precision of random and systematic field sampling survey designs under these two processes of species clustering. Second, we evaluate the performance of 13 estimators for the variance of...

متن کامل

Confidence Intervals for Lower Quantiles Based on Two-Sample Scheme

In this paper, a new two-sampling scheme is proposed to construct appropriate confidence intervals for the lower population quantiles. The confidence intervals are determined in the parametric and nonparametric set up and the optimality problem is discussed in each case. Finally, the proposed procedure is illustrated via a real data set. 

متن کامل

Jackknife and Bootstrap Methods for Variance Estimation from Sample Survey Data

Re-sampling methods have long been used in survey sampling, dating back to Mahalanobis (1946). More recently, jackknife and bootstrap resampling methods have also been proposed for small area estimation; in particular for mean squared error (MSE) estimation and for constructing confidence intervals. We present a brief overview of early uses of resampling methods in survey sampling, and then pro...

متن کامل

Outer and Inner Confidence Intervals Based on Extreme Order Statistics in a Proportional Hazard Model

Let Mi and Mi be the maximum and minimum of the ith sample from k independent sample with different sample sizes, respectively. Suppose that the survival distribution function of the ith sample is F ̄i = F ̄αi, where αi is known and positive constant. It is shown that how various exact non-parametric inferential proce- ′ dures can be developed on the basis of Mi’s and Mi ’s for distribution ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • The international journal of biostatistics

دوره 5 1  شماره 

صفحات  -

تاریخ انتشار 2009